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ON  THE  DUALITY  BETWEEN  FOINTS  AND  LINES 
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Southern  Methodist  University  University  of  Missouri 
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ABSTRACT 

Duality  between  points  and  lines  is  examined,  and  the 
error  distributions  corresponding  to  various  forms  of  duality 
are  simulated.  Several  different  least  squares  procedures  for 
fitting  of  a  line  to  several  points  are  dualized  to  the  fitting 
of  a  point  to  several  lines  and  then  dualized  again  to  give  the 
coordinate  of  a  point  of  intersection  of  several  lines. 

1.  INTRODUCTION 

Duality  appears  in  the  statistical  literature  in  several 
different  forms,  but  none  of  these  forms  appear  to  be  the  same 
as  here.  For  example,  the  term  duality  is  used  for  the 
interchange  of  the  role  of  X  and  Y  in  multiple  correlation 
problems  by  Khatri  (1964),  who  notes  that  Bartlett  (1939)  had 
first  used  this  terminology.  Duality  is  used  extensively  in 
linear  programming.  This  type  of  duality  has  been  exploited  by 
Narula  and  Wellington  (1977a  and  b)  to  obtain  the  minimum  sum 
of  weighted  absolute  errors  in  regression.  It  was  also 
exploited  by  Pinski  and  Sposito  (1976)  and  by  Adriano  (1977)  to 
prove  that  normal  equations  are  consistent.  It  is  used  bv  Book 
(1982)  for  least  absolute  deviations  position  finding. 
Armstrong,  Elam  and  Hultz  (1977)  use  duality  when  dealing  with 
a  two-way  classification  model.  Mardia  (1472)  has  a  book  on 


V?  r.Tf.^  vjv;-^;  t#v  ■* 


the  statistics  of  directional  data  which  deals  with  some  of  the 
topics  considered  in  this  paper,  but  in  a  different  manner. 

The  von  Mises  distribution,  which  is  used  for  errors  around  a 
circle,  is  tabulated  by  Gumbel,  Greenwood  and  Durand  (1953) 

Still  another  form  of  duality  exists  in  statistical  designs  and 
has  an  extensive  literature.  See,  for  example,  Cinha  and  Dey 
(1983).  Another  form  of  duality  exists  in  sample  survey  work. 
See  Sirken  and  Casady  (1982),  or  Lepkowski  and  Groves  (1989), 
or  Sr ivenkataramana  (1980).  Noorbaloochi  and  Meeden  (1983) 
dualize  unbiasedness  and  Bayes. 

Duality  of  points  and  lines  will  be  treated  in  this  paper 
as  problems  in  which  lines  are  represented  by  coordinates 
and  points  are  represented  by  equations.  We  will  focus  on 
problems  of  position  finding,  starting  with  those  considered  by 
Daniels  (1951),  Beale  (1961),  Rosenblatt  (1978),  and  Hsu 
(1979).  A  recent  paper  by  Souvaine  and  Steele  (1987)  exploits 
duality  between  points  and  lines  and  refers  to  some  earlier 
work  on  the  subject  which  was  not  known  to  the  authors  of  this 
paper  when  this  was  written. 

2.  DUALITY  BETWEEN  LINES  AND  POINTS 

Consider  the  equation  of  a  straight  line  in  the  form 
y  =  mx  +  b ,  in  the  (X,Y)  plane.  We  could  just  as  well 
represent  this  line  in  a  (U,V)  plane  by  the  coordinates  (-m,  b). 
Similarly,  we  could  think  of  the  point  (xj,  yj)  in  the  (X,Y) 
plane  and  write  an  equation  for  it  of  the  form  v  =  -xju  +  yj  in 
the  (U,V)  plane.  Hence,  there  is  a  one-to-one  correspondence 
between  points  and  lines  in  the  (X,Y)  plane  and  lines  and 
points  in  the  dual  (U,V)  plane.  Note  that  the  common  point  of 
intersection  of  several  lines  y  =  m^x  +  b^  in  the  (X,Y)  plane 
corresponds  to  the  common  line  connecting  the  points  (-m^,bj) 
in  the  (U , V )  p lane  . 

Now  consider  the  usual  model  in  regression  where  y  =  mx  + 
b  +  e  and  e  is  normally  distributed  with  mean  zero  and  variance 
02.  We  have  a  sample  ( x  j , y  j ) ,  (x2,y2)»  •••>  (xn,yn)  and  b  and 

m  are  estimated  by 
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A 

m 


-  x)(y.  -  y) 

E(x  .  -  x)2 
1 

b  =  y  -  mx 

The  estimator  of  the  regression  of  Y  on  X  i s  y  =  mx  +  b . 

Suppose  now  that  we  are  interested  in  obtaining  an  estimator  of 
the  common  point  of  intersection  of  several  lines  y  =  m^x  +  bj, 
i*=l,  n.  The  equations  of  the  lines  are  transformed  into 

coordinates  as  outlined  above,  so  that  our  data  consist  of 
(uj  >V|) ,  (u2 ,  V2 ) ,  (un,  vn)  where  uA  =  -114  and  v^^  =  bi  . 

We  invoke  the  usual  procedures  to  compute  the  regression 
in  the  (U,V)  plane  and  obtain  v  =  mu  +  b ,  where  m  and  b 
are  "the  same  as"  m  and  b  above  except  that  U's  and  V's  are 
substituted  for  the  X's  and  Y's.  Then  transforming  this  line 
back  to  the  (X,Y)  plane  we  have  the  coordinates  of  the 

A 

estimated  common  point  of  intersection  to  be  (-m,  b).  (Note 
the  change  in  sign  on  the  m.) 

The  question  arises  about  the  error  structure  of  this 
procedure.  In  the  (X,Y)  plane  we  assume  the  form  y^  =  mxj  +  b 
+  for  i  =  1,  2,  ...  n  where  the  EjS  are  independent.  Hence, 
if  we  use  the  same  formulas  in  the  (U,V)  plane  we  assume  that 
v^  =  mu^  +  b  +  for  i  =  1,  2,  ....  n  where  again  the  e^'s  are 
independent  realizations  of  e  which  are  normally  distributed 
with  mean  0  variance  0^.  An  estimator  of  is 

A  2  1  n  a  *  o 

0  =  -  I  (v^  -  mu;  -  b)2. 

n-2  i=l 

A 

We  know  that  (m,b)  are  jointly  bivariate  normal  with 
means  (m,b)  and  va r  iance-covar i ance  matrix 
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Hence,  we  can  construct  confidence  intervals  for  m  and  b 


(the  coordinates  of  the  point  of  intersection  of  n  lines)  as 


and  where  t tt / 2  is  the  (l-a/2)  quantile  of  the  Student 
t-di s t r ibut ion  with  n-2  degrees  of  freedom. 


These  could  then  be  used  to  give  us  a  confidence  region 
in  the  (X,Y)  plane  for  the  point  of  intersection  we  seek.  Thi 
confidence  region  would  be 


A ' .  * '  -  »'  •*„  at.  ■(.  a1 


at  at. 


til.  >a).  laL'KLSt.M  'it. 
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£  2s2F2,n-2(1-a)rnaOO  • 

Hence,  with  probability  1  -  a  we  can  be  sure  that  m  and  b  lie 
in  this  region,  where  F2  n_2(  1— a.)  is  the  (l-a)th  quantile  of 
the  F-distribution  based  on  2  degrees  of  freedom  for  the 
numerator  and  n-2  degrees  of  freedom  for  the  denominator. 

In  the  (U,V)  plane  we  are,  however,  minimizing  the  sum  of 
squares  £(v^  -  muj  -  b)2  with  respect  to  m  and  b,  i.e.,  we  are 
assuming  the  u^'s  are  obtained  without  error  while  the  v^'s 
contain  all  of  the  error.  In  the  (X,Y)  plane  this  means  that 
the  slope  of  each  line  is  without  error  while  the  intercept,  b, 
is  associated  with  all  of  the  error.  That  is,  we  are 
minimizing  the  sum  of  the  squares  of  deviations  measured 
parallel  to  the  y-axis  of  each  line  from  the  point  with 
coordinates  (m,b).  Hence,  the  process  we  are  using 


minimizes  £  (b-  mu.  -  v.)  with  respect  to  b  and  m  and  gives 

i-1  1  1 

the  solution  (m,  b).  In  the  (X,Y)  plane  this  corresponds  to 
taking  the  vertical  distance  from  the  point  to  be  found  to  each 
line  and  minimizing  that  sum  of  vertical  distances  squared. 

For  example,  if  we  have  six  lines 


Equation  of  line  Coordinates  of  line 


y 

=  50.5 

(  0 

» 

50 

5) 

y 

=  46.8 

(  0 

> 

46 

8) 

y 

=  1  .  5x 

+  62.3 

(-1 

5, 

62 

3) 

y 

=  1  .  5x 

+  67.7 

(-1 

5, 

67 

7  ) 

y 

=  3x  + 

80. 1 

(-3 

y 

80 

1) 

V 

=  3x  + 

76.2 

(-3 

y 

79 

2) 

Note  that  for  this  example  the  sample  lines  are  parallel  in 
pairs.  Nevertheless,  we  want  to  estimate  a  common  point  of 
intersect  ion . 

Transforming  to  the  (U,V)  plane  we  get  m  =  93/9  = 

10.3333  and  b  =  48.9333,  or  in  the  (X,Y)  plane  the  point  of 
intersection  is  (-10.3333,  48.9333). 

In  the  solution  above,  for  a  common  intersection  of  n 
lines  we  are  assuming  that  m  is  obtained  without  error  for  each 
of  the  lines,  but  that  b  (the  intercept)  is  subject  to  error. 

Therefore,  in  the  (X,Y)  plane  we  can  be  95%  sure  that  the 
point  of  intersection  is  inside  the  ellipse  given  by: 


or 

(x+10.33)2  -  0.8(x+10.33)(y-48.93)  +  -  (y-48.93)2  S  3.5174 

15 

or  alternatively,  we  can  have  confidence  intervals  on  m  and  b 
as  follows.  We  can  be  95%  confident  that  m  lies  between 

-  10.333  1  2  776  (2.387)  -  or  from  -12.5421  to  -8.1245 

7q 

For  b,  we  can  be  95%  confident  that  it  lies  between 

48.933  t  2.776  (  2.387  )  --  — or  from  44.656  to  53.210. 

yr  yr 

3.  SIMULATION  RESULTS 

To  demonstrate  the  efficacy  of  the  procedure  in  Section  2, 
we  simulated  the  coverage  of  the  above  confidence  intervals  in 
two  cases  where  the  number  of  lines  is  (a)  30  and  (b)  4.  The 


ioint  of  intersection  is  (10,10)  and  0=1.  We  computed 
ifidence  intervals  or  regions.  The  tables  give  the 
age  of  actual  coverage. 


i)  Number 

of  lines 

is  30 : 

ia  1  s 

d  Seed 

S lope(m) 

Intercept ( b ) 

Joint(m,b ) 

) 

7373 

93 

95 

88 

) 

35757 

88 

93 

87 

) 

76373 

93.5 

88 

89 

) 

87197 

91.5 

88.5 

89.5 

i)  Number 

of  lines 

is  A : 

i  a  1  s 

d  Seed 

S  lope(m) 

Intercept ( b ) 

Joint (m, b ) 

I 

3517 

87 

89 

88 

l 

3579 

90 

88 

88 

i 

55373 

93 

88.5 

91.5 

5579 

90.5 

89.5 

88. 5 

hese  results  agree  with  the  90%  confidence  target  value, 
eed  is  the  seed  used  in  the  1MSL  routine  GGNML  which  was 
this  simulation  to  generate  normally  distributed  errors 


FIGURE  1. 

ESTIMATED  INTE 

(XO.YO)— ( !0, 10} . 5 ICMA* 1 ,#t I 
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FIGURE  1. 

ESTIMATED  INTERSECTION  POINTS 

( XO ,  TO )  =  ( 10. 10>.SIGMA=1.#1 ,#!R IALc200,DS££D=55J7J 


FIGURE  8. 

ESTIMATED  INTERSECTION  POINTS 

XO.YOM  10.  J0).5IGUA»1.#UNE>-<l#TRIAL-200.0SEE0»5579 


where  k  =  (SSyy  -  SSxx)/2SSXy  and  the  sign  of  SSXy  is  chosen 
for  the  -  sign.  This  has  been  extended  to  the  multivariate 
case  by  Fuller  and  Amemiya  (1984).  Here 


2 

SS  =  ly 

yy  i 


ayJ2 

n 


SS  =  LX2 

XX  1 


(Ex.  )2 
1 


n 


SS 

xy 


Lx .  y , 

it 


Ex.Ey . 

l  i 


n 


Then  b  is  found  from  b  =  y  -  mx . 

In  the  problem  of  Section  2 
SSVV  =  983.79333 

^uu  =  ^ 

SSUV  =  -93  . 

Hence  , 

k  =  -5.2408 

or 

m  =  10.5762 

then 

A 

b  =  v  -  mu  . 

=  64.4333  -  10. 5762( 1.5) 

=  48.5690. 

The  coordinates  of  the  point  of  intersection,  assuming  that  sum 
of  squares  of  distances  from  a  point  to  the  lines  is  minimized, 
become 

(-10.5762,  48.5690) 


in  the  (X,Y)  plane,  which  should  be  compared  with  (-10.3333, 
48.9333)  which  was  obtained  by  ordinary  least  squares  (OLS). 


5.  THIRD  SOLUTION  BASED  ON  PLUCKER  LINE  COORDINATES 

Let  us  also  consider  the  lines  written  in  the  form 


It  is  in  this  form  that  the  distance  from  a  point  to  a  line  is 
calculated.  Therefore,  if  the  sum  of  squares  of  distances  from 
a  point  to  various  lines  is  to  be  minimized  we  will  assume  the 
equation  of  a  line  is  written  in  the  form 

ux  +  vy  +  1  =  0 

which  is  the  form  for  the  Plucker  coordinates  of  projective 
geometry . 

Then  the  Plucker  coordinates  for  the  lines  in  the  example 
of  Section  2  become 

(0,  -0.019802) 

<0,  -0.0213675) 

(0.024077,  -  0.0160154) 

(0.0221566,  -  0.014771  ) 

(0.0374532,  -  0.0124844) 

(0.0378788,  -  0.0126263)  . 

The  least  squares  line  here  is 

v  =  0.21 29882u  -  0.0204991 
or 

-  1 0 . 390 1 24 u  +  48.78262v  +  1  =  0  . 

Hence  the  intersection  point  is 
(-10.3901,  48.7826). 

h.  INTERCHANGING  ROLF.S  OF  X  AND  Y 

For  both  the  ordinary  least  squares  solution  and  the 
Plucker  line  coordinate  solution  above,  we  can  interchange  the 
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role  of  X  and  Y  and  obtain  two  additional  solutions.  Note 
that,  the  second  solution  above  is  symetric  in  X  and  Y  so  the 
solution  is  the  same. 

In  the  case  of  Section  2,  this  amounts  to  the  error  being 
on  the  slope  while  the  intercept  is  measured  exactly.  Since  X 
and  Y  are  merely  interchanged,  the  roles  of  U  a..J  V  are 
interchanged.  More  explicitly,  the  observed  data  consist  of 
(u],v|),  (U2.V2),  •••,  (un,  vn)  where  uj  is  the  observed  slope 

which  is  subject  to  error  and  v^  is  the  observed  intercept 
which  is  now  exact.  Consider  the  model 

Uj  =  cjvf  +  c0  +  Ei  i  =  1  ,  2,  .  .  .  ,  n 

where  the  e  ^  ’  s  are  independently  and  normally  distributed  with 
mean  zero  and  variance  0 2.  By  interchanging  the  roles  of  X  and 
Y,  we  simply  mean  that  we  regress  {uj}  on  { v ^ }  by  the  above 
model  and  obtain  the  regression  line 

u  =  cjv  +  cQ  , 

where  cj  and  cq  are  OLS  estimators  of  cj  and  eg, 
respectively.  And  then  we  can  transform  this  regression  line 
in  the  (U,V)  plane  back  to  the  point  in  the  (X,Y)  plane,  and 
obtain  the  point  of  intersection  from 


The  points  of  intersection  for  the  example  of  Section  2  are 
given  in  the  following  table  for  all  the  error  structures. 

Regress  ion  of  Y  on  X  Regression  of  X  on  Y 


OLS  : 
Fill  ler  : 


Plucker  : 


(-10.3333,  48.9333) 
(-10.5762,  4B.5690) 
(-10.3901,  48.7826) 


(-10.5784,  48.5657) 
(-10.5762,  48.5690) 
(-10.7325,  48.3540) 


The  error  structure  for  the  Plucker  coordinates  is 
difficult  to  characterize,  but  it  is  interesting  to  see  the 
results  in  this  example. 

7.  RELATIONSHIP  TO  OTHER  APPROACHES 

The  problem  considered  here  is  directly  related  to  work 
by  Daniels  (1951)  and  others  and  summerized  in  Mardia  (1972). 

In  this  other  approach  position  finding  is  characterized  by 
several  angular  readings  whicn  follow  a  circular  normal 
distribution.  In  Section  2  above,  the  error  was  on  the 
intercept  and  in  Section  6  (where  X  and  Y  were  interchanged) 
the  error  was  on  the  slope.  We  can  think  of  the  slope  as  tan  0 
where  0  is  the  angle  a  line  makes  with  the  horizontal  axis.  We 
might  then  ask  what  the  distribution  of  0  looks  like  if  tan  0 
is  normally  distributed  with  mean  zero  and  deviation  one.  This 
is  a  simple  problem  to  solve  but  surprisingly  we  find  that  the 
distribution  of  0  for  -  ti  2  i  0  ;  tt  2  has  two  modes  at  -  n  -  k . 

In  fact,  the  density  looks  almost  uniform  over  the 
interval  -1.25  £  0  £  1.25.  Data  from  this  distribution  would 
be  hard  to  distinguish  from  data  from  a  uniform  distribution. 

8.  CONCLUSION 

The  ideas  in  this  paper  are  easily  extended  to  three  and 
more  dimensions.  In  fact,  they  may  be  even  more  practically 
useful  in  these  higher  dimensions  where  the  alternative 
approach  as  given  by  Mardia  (l‘?72)  and  others  has  not  been 
extended.  If  we  think  in  terms  of  trying  to  locate  objects  in 
space  or  beneath  the  surface  of  the  earth,  (sav,  the  focus  of 
an  earthquake),  it  is  clear  that  the  duality  should  be  between 
planes  intersecting  the  earth  and  points  in  space.  This 
extension  to  higher  dimensions  will  be  considered  in  another 


paper  . 
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